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A RATE OF CONVERGENCE RESULT FOR THE LARGEST 
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It has been recently shown that if X is an n x N matrix whose 
entries are i.i.d. standard complex Gaussian and h is the largest 
eigenvalue of X*X, there exist sequences m n ^ and s n ,N such that 
(h — m n ,N)/s n ,N converges in distribution to W2, the Tracy-Widom 
law appearing in the study of the Gaussian unitary ensemble. This 
probability law has a density which is known and computable. The 
cumulative distribution function of Wi is denoted Fi- 

In this paper we show that, under the assumption that n/N —> 
7 £ (0, 00), we can find a function M, continuous and nonincreasing. 
and sequences /t n ,jv and a n .M such that, for all real so, there exists 
an integer N(so,'f) for which, if (n A N) > N(so,~y), we have, with 

ln,N = (h — /in, At) / '(7n,N , 

Vs > so (n A iV) 2/3 |P(Z„,iv < s) - F 2 (s)\ < M(s ) exp(-s). 

The surprisingly good 2/3 rate and qualitative properties of the 
bounding function help explain the fact that the limiting distribution 
W2 is a good approximation to the empirical distribution of l n ,N 
in simulations, an important fact from the point of view of (e.g., 
statistical) applications. 



1. Introduction. Very important progress has been made in recent years 
in our understanding of the behavior of the eigenvalues of a large number of 
large-dimensional random matrices. Many new results concern the fluctua- 
tion of these eigenvalues: we now have convergence in distribution results, as 
opposed to maybe more classical almost-sure convergence statements. These 
new findings show great promise for applications, in particular, in Statistics. 
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Our focus in this paper is on fine convergence properties of the largest 
eigenvalue of a class of random covariance matrices. Our work owes a lot 
to the pioneering work of Tracy and Widom [20, 21, 22] and also to that 
of Johnstone [11]. Exciting recent developments in the area of random co- 
variance matrices and/or Tracy- Widom distributions can also be found, for 
instance, in [2, 3, 4, 5, 6]. 

Let us now be more specific about the question we address. In a series 
of papers by Forrester [8], Johansson [10] and Johnstone [11], it was shown 
that if X is an n x N matrix with i.i.d. standard complex Gaussian entries, 
l\, the largest eigenvalue of X*X, when properly renormalized, converges in 
distribution [when n/N ->7G (0,oo) as n — > oo] to the Tracy-Widom law 
appearing in the study of the Gaussian unitary ensemble, which we call W%. 
The paper [7] shows that this result holds even when n/N — > oo or 0. We 
recall that a random variable Z is said to have a standard complex Gaussian 
distribution [denoted A/"c(0, 1)] if Z = [Z\ + iZ-i)/\[2, where Z\ and Zi are 
i.i.d. AT(0,1). 

The chronology of the problem is the following: Forrester worked on the 
case n — N constant (see Section 3.5 in [8]), Johansson (Theorem 1.6 in 
[10]) showed that the result held when n = jN + ajv, with = 0(N 1 ^), 
and Johnstone (Section 3 and Appendix A. 6 in [11]), while being primarily 
interested in the case of real entries, relaxed the assumption to n/N — > 7 G 
(0, 00). In [7] we show that the result holds even when n/N — > or 00. 

We denote by Fi the cumulative distribution function of Wi- Following 
[11], we call matrices like X*X, that is, complex Wishart matrices with iden- 
tity covariance, complex white Wishart matrices. In random matrix theory 
terminology, we are therefore considering the Laguerre unitary ensemble 
(LUE). We remind the reader (see [20]) that if q solves 

q" (x) = xq(x) + 2q s (x) and q(x) ~ Ai(x) as x — > 00, 

then 

F2(s)=exp(—J (x — s)q 2 (x) dx^j . 

Let us recall the main result we are using from [8, 10] and [11]. Let 
n + =n+ 1/2 and ^+ = ^+1/2, 

fi n ,N = (V^+V^) 2 and a niN = (^nJ+^N^)(n- 1/2 +N- 1/2 ) 1 / 3 . 
The following was progressively shown in these papers: 

Theorem 1. If X is an n x N matrix with i.i.d. jVc(0, 1) entries and 
n/N — > 7 G (0, 00) when n — ► 00, 

h(X*X) - fi riiN w 
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To be completely correct, we need to say that the results in [8, 10, 11] 
are not stated exactly as above, but they imply their respective version of 
Theorem 1 in the aforementioned form. In particular, the different authors of 
[8, 10, 11] use different centering and scaling sequences. For instance, in [11], 
the real analog of Theorem 1 is stated with n + replaced by n — 1 and N + 
replaced by N. However, // n jv an d <J n ,N are the natural sequences from the 
point of view of the analysis presented in [11] [see Section 3 and equations 
(A.7) and (A.8) there]. 

It is clear that to use the Tracy-Widom law in Statistics or other applied 
areas, we would like to know what is the rate of convergence in Theorem 1. 
In other words, we would like to have a result similar to the Berry-Esseen 
refinement of the central limit theorem. The celebrated Berry-Esseen the- 
orem states that the error made by approximating the distribution of the 
mean of a sample of size m of i.i.d. random variables with a third moment 
by a Gaussian distribution is of size m -1 ' 2 . Here we will show that, for the 
properly renormalized largest eigenvalue of X*X, a similar measure of error 
is of size at most m -2 / 3 , where now m = min(n, N). We will also shed light 
on the involved issue of choosing good recentering and rescaling sequences, 
which significantly improve the speed of convergence result. 

We need to introduce some notation before stating the theorem. We call 
our final centering and scaling sequences /2 n ,iV an d cr n ,N-, and define them 
below. We will denote by 

li(X*X) — jl^N 
l"n,N = = 

the sequence of centered and scaled largest eigenvalues. 

Theorem 2. Let us assume that N <n, and that n/N — > 7 G [1, 00). 
We call 



ln,N 



1/2 

1/2 ' 



1 ln,N 
On,N = (1 + 7n,7Vj h 



and 



Cn-lJV &n,N-l 



Hn,N —1/9 +1/9 1/9 "+ 



1/2 1 1/2 I \ 1/2 1 1/2 

a n-l,N a n,N-l / V A i ra-l,Af°'n-l,7V Hn,N-l<?n iN -i 

Let l n> N denote the largest eigenvalue of X*X renormalized as above. 
There exists a function M , such that, for all real sq, there exists an integer 
N(sq, , j) for which we have 

Vs > s ,VN > iV(s ,7) iV 2 / 3 |P(Z n! jv < s) - F 2 (s)\ < M(s ) e - S . 
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If N > n, the theorem is valid after we switch n and N in all the displays. 
This effectively shows that it holds for 7 6 . The function M can be chosen 
to be continuous and nonincreasing. It has a finite limit at +00, and is 
unbounded at —00. 

The theorem remains valid under a wider set of technical conditions on 
fj, Ht N and <7n,7V detailed in Section 4.1. More information about M can be 
found in Section 4.2. 

To explain the importance of the centering and scaling sequences, we have 
to say that with fi ni N and a Ht N as renormalization sequences, we could not 
show that the speed of convergence was higher than iV 1 / 3 . The proof of 
Theorem 2 makes clear that /J. n ,N — the centering sequence — was probably 
in this case the quantity that was "slowing us down." 

It should be noted that the bounding function M and other bounding 
functions we will obtain in the course of the proof look like they might 
depend on the particular 7 we are choosing. We will see in Appendix A. 6 
that we can actually bound them independently of 7. 

Before we proceed, we wish to mention that Iain Johnstone, while work- 
ing on an analog of Theorem 1 for the so-called Jacobi ensemble of random 
matrices [12], found that the pointwise convergence of the corresponding 
kernel to the Airy kernel could happen at rate 2/3 (see Section 2 for more 
information on how eigenvalue problems are turned into integral operator 
ones, with explicit kernels). He then adapted his computations to the La- 
guerre case we are considering and found the same pointwise result. Though 
such pointwise techniques do not yield similar estimates for convergence in 
trace class norm, which is the delicate notion of convergence we have to use 
for the problems under investigation, his computations suggested that the 
2/3 rate in Theorem 2 was a plausible goal. We wish to point out that an 
important piece of our analysis, Lemma 2, highlights a phenomenon that 
is similar in spirit — but different in substance — to the one observed by Iain 
Johnstone in his pointwise computations. We are very grateful to him for 
sharing his insights with us. 

The plan of the article is as follows: in Section 2 we review the tech- 
niques used in [11] to obtain Theorem 1 and outline our strategy for the 
proof. We then proceed to give the proof (in Section 3) of Theorem 2 in 
two steps. We will first show a naive analysis, with nonrefined centering and 
scaling sequences, to get intermediate (and needed) results and to highlight 
the difficulty that arises. We then provide a solution that makes natural 
our choice of jl n ^N and a n ^- Finally, we discuss in Section 4 some of the 
properties of the bounding function M and present simulations to assess 
the quality of the Tracy-Widom approximation — using our centering and 
scaling sequences — across a range of dimensions. 
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Most of the technical questions that are necessary to carry out the proof 
but would obscure its explanation are relegated to the appendices. In the 
rest of the paper we will assume that n> N, n/N — > 7 G [l,oo). Since X 
and X* have the same largest singular value, this will take care of n < N 
and 7 G (0, 1] . 

2. Outline of the proof. 

2.1. Review of known results. A strength of the method used in [11] is 
that the intermediate steps lead to finite-dimensional equalities, and limits 
are taken only at the last step. This is a crucial element in our being able 
to get rates of convergence estimates. 

Recall that Johnstone, using Tracy and Widom's work [21, 22], shows 
that 



where S T is a known kernel acting here on L 2 ([s, 00)), and det is understood 
as a Fredholm determinant. As an aside, let us remark that the previous 
equation holds for arbitrary choices of centering and scaling sequences. Nat- 
urally, the resulting operator depends strongly on which choice is made, but 
for all choices such an operator exists. In particular, in [11], the operator 
S T (and, hence, the index r) depends on the sequences fi nt N and o- n ^ (see 
Section 3 and equations (A. 7) and (A. 8) in [11]). We will nevertheless use 
the notation S T even when working with different sequences fi n ,N and a n> N 
(instead of /i n) jv and o" n ,jv); in order to avoid cumbersome notation like Sf, 
but we will keep in mind that our S T is not necessarily exactly the one ap- 
pearing in [11]. We now go back to our argument. Using continuity of the 
Fredholm determinant with respect to trace class norm, Johnstone proceeds 
to show that S T — > S in trace class norm, where S, the Airy kernel, was 
shown in [20] to satisfy 



It turns out that we can even control the difference of two Fredholm 
determinants in this situation using the following result from [18], cited 
in [17] (Lemma 4, pages 323-324) and [9] (Section II. 4 and Theorem IV. 5. 2): 

Lemma 1 (Seiler-Simon). Let A and B be in S\, the family of all trace 
class operators on a separable Hilbert space 7i. If\\ ■ ||i represents trace class 
norm, we have 




<s)= det (Id - S T ) 



F 2 (s) = det(Id-5). 



det (Id - A) - det (Id -B)\<\\A- B\\ t exp(P||i + + 1). 
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2.2. Strategy for the proof. The strategy is now clear. Since all we want 
to control is the difference | det(Id — S T ) — det(Id — S)\, the previous display 
makes clear that we just need to study \\S T — S\\i. 

As a matter of fact, ||5||i(s) is, when seeing S as an operator on L 2 ([s, oo)), 
a well understood function of s and will not cause any problems. We give 
more detail in Section 4.2. Working toward the control of \\S T — S\\±, we 
recall that we know from [11] that 

and 

S = 2G 2 , 

for explicit G, G T and H T , all of them being Hilbert-Schmidt operators on 
L 2 ([s,oo)), for all s G R. 

We have the following elementary lemma: 

Lemma 2. 

2 1 1 S T — S 1 1 1 <J || G T + H T — 2G 1 1 2 1 1 G T + H T + 1G 1 1 2 + 1 1 G T — H T 1 1 2 , 
where || • H2 represents the Hilbert-Schmidt norm on L 2 ([s, 00)). 

Since the tools involved in the proof of this lemma are somewhat different 
from the ones involved in other proofs, this is proved in Appendix A. 4. 
So to prove Theorem 2, it will be sufficient to show the following: 

Lemma 3. Assume that n/N ->76 [l,oo). Then we have Vso £R, 
3iV(so,7),Vs>so,VAf >iV(s ,7), 

(PI) N 2 / 3 \\G T + H T - 2G\\ 2 < C(s ) exp(-s/2), 

(P2) AT 1/3 ||G r - G|| 2 < C(s ) exp(-s/2), 

(P3) N^WHt - G\\ 2 < C(s )exp(-s/2), 

where C is a continuous, nonincreasing function. 

The rest of the article will be devoted to proving these estimates. Let 
us now make a few remarks. The first one is structural: our proof makes 
fundamental use of the structure of S T , that is, the fact that it is the sum 
of the product of two Hilbert-Schmidt operators. It also heavily relies on 
the fact that those operators [seen as acting on L 2 ([s,oo))] are kernel oper- 
ators with kernels of the form K s {x,y) = K{x + y — s), which reduces our 
problem to studying certain functions, as opposed to the potentially more 
complicated objects that are general operators. Other problems having this 
same structure could be attacked by the same approach. 
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Closer to the problem we actually work on, let us mention that we have 
the choice of which rate to increase from 1/3 to 2/3 in Lemma 3. In our 
decomposition of S T — S, G T + H T — 2G plays a more important role (in 
terms of rates) than G T — G or H T — G do. This is why an important effort 
will be devoted to showing that we can get rate 2/3 for the convergence of 
G T + H T — 2G in the Hilbert-Schmidt norm to 0. We will show later that 
the "natural" centering and scaling (natural for the perturbation analysis) 
leads to rate 2/3 for \\G T — G\\2, but rate 1/3 for the two other elements 
of the previous display. Our "optimized" centering reflects the fact that we 
had to find a trade-off between an optimal centering for G T and an optimal 
centering for H T . This is partly why the "optimized" sequences look so 
involved. 

In other respects, the operators mentioned here are kernel operators whose 
kernels are well known and understood. Because these kernels can be related 
to the solution of a perturbed Airy equation, our task is now essentially 
reduced to studying in detail the properties of a solution of a certain dif- 
ferential equation. This will become more clear in the course of the article. 
Note, finally, that Lemma 3 only deals with Hilbert-Schmidt norms, which 
are considerably simpler to manipulate and bound than trace class norms. 

2.2.1. Technical details on the elements of the problem. Because of ele- 
mentary results of linear algebra, namely, the fact that X and X* have the 
same singular values (except for the multiplicity of those at 0), we can and 
do assume that n > N in what follows. The notation we use are the ones 
found in [11]. 

Let us introduce ajy = n — N , and 



where is the fcth Laguerre polynomial associated with a. Then calling 

ajy = V Nn, ip and ifj are defined as 



Now let Ai denote the Airy function, <p T (s) = CT nt Np(P<n,N + sa n ^) and sim- 
ilarly tp T (s) = 5 n ,N'ip{ii>n,N + sa n ^)- Finally, we have, when considering G T , 
H T and G as operators on L 2 ([s, oo)), 

G T {x,y) = tp T (x + y-s), G(x,y) = 2~ 1/2 Ai(x + y-s), 
H T (x,y) = tp T (x + y- s). 
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The tp and ip we introduce here are the same as in [11], but we have used 
some elementary properties of Laguerre polynomials (see [19], page 102) to 
simplify their expression. All the details are given in Appendix A. 5, where 
one will also find a remark explaining why the case a at = does not pose a 
problem. This makes our rate work much simpler later, since we will have 
to deal with only two pieces (instead of four if we had kept the original 
representation) when adjusting the centering and scaling sequences to get 
rate 2/3. 

2.2.2. Formulating the problem. In his study, Johnstone [11] introduced 
an "intermediate" function that proved to be a powerful tool in the study 
of the problem. We introduce, following his idea, the very closely related 
function 

(I) F n>N (z) = (-l)^< i 1 /y / An/^!z^ +1 )/ 2 e- z / 2 L7(z). 

This is valuable as the study of -F n ,AT as N gets large turns out to be the 
study of a perturbation of the Airy equation, and this has been the subject 
of extensive investigations. See, for instance, [15], Chapter 11. In special 
functions parlance, we have 

F n , N (z) = a~^ 2 -==W KN ,x N (z), 
ynlJSi 

where W KNi \ N stands for the Whittaker function with parameters kjv = 
N + (a N + l)/2 and X N = a N /2. 

In the situation we are investigating here, namely, n/N tends to a finite 
limit as n and N get large, [11] introduces the investigations of Olver [15] 
concerning the so-called differential equations with one turning point to 
study the properties of W KNt \ N . In the case n/N — ► oo, we would have to 
take care of the fact that the turning points coalesce (see [7]) to do the 
same. Note that we cannot really apply Olver's results in [15] directly, since 
the differential equation we consider depends on two parameters. In Ap- 
pendix A. 3, we explain why they apply nonetheless. This was implicit in 

[II] , but we make it explicit for the sake of completeness. 

We now need to relate F n N to ip(-;a) and tp T . We remark that 

ip N {x;a N ) = {-l) N a l J 2 N x~ 1/2 F n ^ N (x), 
, s [on 1/2 „ , s , 

<PW = ^— CJ n / _ liAr F n „i iA r(x)/x 

and 

, / 1/2 

1 f Va N o- n _ 1N a ntN \ Vn-i,N 



tp T (x) = —7=[ ' )F n _ 1 N (n nN + a n:N x)[ 

V2 V Mra-l,JV / V 
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We have similar expressions for ip T , if we replace (n — 1,N) by (n,N — 1) in 
the previous expression. 

So the problem essentially reduces to having a good understanding of 
Fn-l,N an d the other quantities written in the equation defining ip T . 

2.2.3. What do we need to control in order to get the rates? Our objec- 
tive is to get the rates mentioned in Lemma 3. In order to do this, we will 
prove the following fact: 



Fact 2.2.1. Suppose n/N — > 7 e [1, 00). Let us call G{x) = M{x)/y/2. 
We have V s 3 iY(s ,7) V z € [s , 00), ViV > N(s ,j), 



for a function C that is continuous and nonincreasing on R. 

The reason this implies Lemma 3 is that all the kernels — when we see 
the operators as acting on L 2 ([s,oo)) — are of the form K s (x,y) = K(x + 
y — s), and we are dealing with Hilbert-Schmidt norms. In somewhat more 
detail, let us consider a Hilbert-Schmidt operator A [on L 2 ([s,oo))] with 
kernel K s (x, y). Let us assume that K s (x, y) = K(x + y — s) and n^|if(z)| < 
Xexp(— z/2), for a certain \ £ Let us call || • \i{s) the Hilbert-Schmidt 
norm on L 2 ([s, 00)). It is well known (see [16], Theorem VI. 23, page 210) that 
ll^lli^) = II[s oo) 2 \^s(x,y)\ 2 dxdy. So in our situation, since \K s (x,y)\ 2 = 
\K(x + y — s)\ 2 < n~ 2f3 x 2 exp(— (x + y — s)), we immediately have ||A||2(s) < 
n~^x ex P( — s /2)- So Fact 2.2.1 clearly implies Lemma 3. 

3. Proof. Before we proceed to giving the proof, we remind the reader 
of the analysis carried out in [11]: F n ^ (actually a closely related quantity) 
was analyzed using the Liouville-Green approximation, which we detail now. 
Pushing further this type of analysis will give us explicit bounds on how far 
F nt N deviates from the Airy function and will be the centerpiece of the 
analysis. A remark about notation: they are "naturally" heavy and we warn 
the reader that some abuse will take place, as we will not always use both 
indices "n" and U N" even when it is clear that the function or sequence 
depends on both of them. But they will always be present when we give 
rigorous arguments and there might be a doubt about the quantities we are 
talking about. 

This section is organized as follows: we first recall in detail the method 
that was used in [11] to show convergence of S T to S. We then prove an 



(2) 
(3) 
(4) 



A^ 2 / 3 |^ T (z) + W(^) - 2G(z)\ < C(s ) exp(-z/2) 
JV^VCs) - G(z)\ < C(s ) exp(-z/2) 
NWtyriz) - G(z)\ < C(s ) exp(-z/2) 
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intermediary result concerning the rate of convergence (2/3) of F n ^(n n ^ + 
&n,Nx)(nn,N / (fJ-n,N + &n,Nx)) to Ai(x) on semi-infinite intervals of the form 
[so,oo). Then, we will show (in Section 3.4) that we can only guarantee a 
rate of 1/3 if we slightly perturb the centering and scaling. This will show 
equations (P2) and (P3) of Lemma 3. Finally, we will prove equation (PI) 
of Lemma 3 in Section 3.5. 



3.1. Liouville-Green approximation for Whittaker functions. We recall 
that if we call a N = n-N, w N {x) = x^ N+1 ^ 2 e- x/2 L% N (x), we have 

ran.-** i \ d2yjN / 1 % , ^-1/41 

(Whittaker) — — = {- h 9 >w N , 

dx z [_ 4 x x z J 

with k n = N + -*dhl an d \ N = Zf. 

Following [11], after changing variables to £ = x/kn, we get 

d2lVN { 2 f( C \ i icw 



ale 



with 



f(0 = — ^ — and 9(0 =-^2- 

Here 6 = 2- J A - ujI and 6 = 2 + JA-uj 2 with u; N = ^ = 2- " 



Under our assumptions about n and N, iof^ G [0,4 — 5], with 5 > 0. This 
prevents the turning points £i and £2 from coalescing, a crucial point in 
what follows. Note that, to be precise, we should write that /(£) is really 
f(uN ■,£,)■, as the family of functions / is indexed by ojn- 

Now, following [15], Chapter 11, [11] introduces the change of variable 
(known as the Liouville-Green or WKB method) 



C 3/2 = f f l/2 (t)dt. 



Defining a new dependent variable W by w = (d£/d£) l / 2 W , one gets the 
new differential equation 

d 2 W 

-J^ = W N C + V(UN,()}W, 

where we insist on the fact that v is also a function of u, because / was. 
We also note that v has a somewhat explicit expression, to which we come 
back in Appendix A. 3, so the reader is referred there for more details. Then, 
if we denote by / = f /C, the recessive solution of the equation (Whittaker) 
satisfies (see [11], equation (5.16)) 
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where 

| £2 («iv,oi<M(4 /3 c)E- i (4 /3 o 



cxp 



M and E are the modulus and weight functions introduced in [15] , Chap- 
ter 11, pages 394-397. Also, Ao = 1.04, and F{uS) is bounded when to 2 is in 
[0,4 - 5] (see Appendix A.3). The sequence c Kjv has an explicit expression, 
and a little algebra (see also [11], equation (5.20)) leads to 

/ \ 1/6 

(5) F n , N (x) = Hv r 1/4 (£){Ai(4 /3 C) + e 2 ( KN , Oh 

with x = kat£. If we denote by n + = n + 1/2 and N + = N + 1/2, 

2 2vrexp[-(n + + N + )]nl + N+ + 
r N = 



N\n\ 

with ttv being nonnegative (as an aside, note that our r^ corresponds to 
[ll]'s (— l) N rj\f). We will occasionally use only the index N instead of n,N. 
When needed, we will make precise the n we are dealing with. 

3.2. Gathering the different elements: a useful intermediate result. This 
subsection sets up the core of the technical work of the article. We intro- 
duce a class of functions (containing, of course, ip T and ip T ) and study in 
Section 3.3 how its members deviate from the Airy function; there, we are 
going to choose the equivalent of an optimal centering and scaling for the 
(n, N) pair and show that we can obtain rate 2/3 [recall that ip corresponds 
to (n — 1, N) and tp corresponds to (n, N — 1)]. At the end of Section 3.3 we 
will have a much finer understanding of the issues involved, will know what 
is easy and hard to deal with and will have all the technical elements needed 
to tackle the proof of equation (PI). The conclusion of our work is that one 
could actually have rate 2/3 in (P2) and (3). The proofs will also imply that 
getting the rate 1/3 is "easy," and so we should reserve the hard-earned 
2/3 rate to something that is harder to deal with, namely, equation (PI). 
Section 3.3 will give us all the elements needed to do this, the details being 
taken care of in Section 3.5. We now carry out in detail the analysis. 

3.2.1. About the deviation of F n ^ and related functions from Ai. Let 
us note that <p T (s) and ip T (s) have the same functional form. They can be 
written 



T2 a3 < kFhk{x{s A^))' 



V2 

where j and k are integer indexes, Fj k stands for the function introduced 
in equation (1), fij^ is the sequence mentioned in the Introduction, x(s) = 
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m j,k + s j,kS and 0£j t k, m j,k and Sj^ are sequences of numbers independent 
of s. Getting a lower bound on the rate of convergence in our original problem 
essentially reduces to studying how members of this class of functions deviate 
from the Airy function. A central element in doing this is the expression for 
F Ut N mentioned in equation (5). We will decompose it into four blocks: rjy, 

(W<7v) 1/6 .r 1/4 , Ai(4 /3 C) and e 2 (k n ,0- 

The easiest part to deal with is £2, as we will see, it is controlled by 1/kjv, 
with K at ~ (1 + 7)A r /2, and our aim is to get only speed TV 2 / 3 . So it is not 
going to cause us any problems (we will show that we control the other part 
of the expression defining £2). 

Given the expression immediately following equation (5), has easy to 
analyze asymptotics, and as is shown in equation (A.l), we have [using the 
notation o(n _1 ,./V _1 ) to state that a quantity is an o of both n and N, 
independently of how they mutually go to 00] 



r N = l + 



1 


ri 


1 " 








- + 




+ o( 


- ~) 


48 


n 


n_ 


v n' NJ 



Once again, it is not a troublesome quantity — given our objective of a 2/3 
rate — since \rj\i — 1| goes to like 1/N. Nevertheless, it will sometimes sim- 
plify our work to keep it in the analyses. 

Of central interest will be (ktv/ct 3 tv) 1 ^ 6 / -1 ^ 4 - We show in equation (A. 3), 
that, if £ = £2 + £jv> £jv small with respect to 1, there exists a sequence 
tending to a (finite) limit, denoted t]n, such that 

(«v<iv) 1/6 r 1/4 (£) = 1 - h^N + o(4). 

Since en will be as big as 0(N^ 1 ^ 2 ), we will have to be more careful about 
this part in the final steps of the analysis. 

9/0 

Finally, the most problematic part will turn out to be Ai(« A > £). We recall 
that it was shown in [11] that Kj(j C — ► s for xn{s) = /x n> v + &n,NS- Here we 

2 /3 

will need to go one step further in the asymptotic expansion of £. For 
ease of exposition, we will first focus on xn(s) = (J, nt N + ^n.TVS, and will get 
to xn(s) = p> n ,N + &n,NS — the "optimal" centering and scaling for tp T + ^ T — 
only after we understand what goes "wrong" in terms of rates with xn(s). 
We will focus in the next several subsections on 




since the proof of Lemma 3, via Fact 2.2.1, will rest on our ability to analyze 
quantities of the type 

A n ,N(xN(s)) = \a ntN 9 n , N (x N (s)) - Ai(s)|. 
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We split A ni 7v into two parts: 

A njN (x N (s)) < a ntN \9 ntN (x N (s)) - r7vAi(s)| + \r N a n , N - l||Ai(s)| 
< a ntN A J ntN (x N (s)) + A^ A (s) 

with 

A l n iN ( x N(s)) = \O n ,N(xN(s)) ~ r N Ai(s)\ 

and 

A n,N( s ) = \ r NO!n,N ~ l||Ai(s)|. 

We will assume from now on that a nt N = 1 + 0(1/N). Of course, we will 
verify that the sequence we eventually use has this property. We are going to 
show that, for s 6 [so,oo), and C a function that might change from display 
to display but will always be continuous and nonincreasing, 

(11) N 2 / 3 Al N (x N (s)) < C(s ) exp(-a/2), 

(12) NAl N (s)<C(s )eM-s/2). 

3.2.2. Rationale for the rates. Before delving into the details of find- 
ing the bounds, we want to explain what is the rationale behind the rates 
corresponding to the different elements of the upper bounding sum. Recall 
that we are now working with a particular centering and scaling, namely, 

X N (s) = Hn,N + 0- ni NS: 

• A^ N : essentially what happens is that, with this centering and scaling, 

k 2 ^ 3 Cn(xn(s)) converges to s at speed iV 2//3 , and we are able to deal with 
the rest of the elements at this "speed." 

• A^ 1 N : here, of course, only the rate of convergence of rjya ni 7v to 1 matters. 
The proof of (12) is an immediate consequence of the estimate we gave for 
r^v and of our assumption that a n< N = 1 + 0(1/N), so we do not need to 
worry about it. 

2 /3 

3.3. Proof of (II). Given the dynamics of C(xn(s)) (explored in 
more detail in Appendix A. 2. 3), we can achieve our objective by using the 
coarse bound 

N 2 / 3 A l njN (x N (s)) < N 2 / 3 \6 niN (x N (s))-r N Ai(K% 3 ((x N (s)))\ 

+ N 2 / 3 r N \Ai(4 3 ((x N (s)))-Ai(s)\. 
We will use the following notation: 

B n , N (x N (s)) = \9 n>N (x N (s)) - r N Ai(K 2 ] { 3 C(x N (s)))\, 
D njN (x N (s)) = Ai(k^ ((x N (s))) - Ai(s). 
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Hence, the previous inequality is just 



N 2 '^\ N < N 2 / 3 B n>N + r N N 2 / 3 \D, 



n,N\, 



where we have dropped the argument xn(s) in the interest of clarity. We 
will continue to do so when there is no ambiguity about the argument we 
are manipulating. 

It will turn out that B n ^ is more "robust" to recentering and rescaling 
than D n ^: when modifying slightly the centering and scaling [i.e., going 
from xjv(s) to xn(s)], we will be able to achieve the same rate — 2/3 — for 
B n ,N(xN(s)) but not for D njN (x N (s)). 

We are going to split [so,oo) into three varying intervals, namely, [so,si], 
[si, A^ 1//6 si] and [iV 1 / 6 ^!, oo); on the first two of these intervals, we will prin- 
cipally use our understanding of £(£)• This splitting will turn out to be 
natural because we will need to be precise on [so, A rl / 6 si], and will rely on 
Taylor expansions to carry out the work. TV^si is small enough at the rel- 
evant scale that they will be uniformly valid on [so, N^' 6 si\. On the other 
hand, given the speed of decay of the Airy function, we will use coarse decay 
bounds for this special function on the last interval, where s is necessarily 
large, because N is large enough. 

Before we delve into the details, let us jump ahead and explain what 
controls the speed of decay to zero of B n< N and D n ^. 

For B n< N(xN(s)), the key quantity is going to be 



which, as we will see, is of "order" en(s)(= £(s) — £2 = x n{s)/^n — £2)- 
So B n: N is also of this order, and the announced rate will hold because, 

2 /3 

roughly, en{s)/s = 0(1). This is more a take-away, heuristic message 
than a precise mathematical statement, but with this in mind, we can make 
everything rigorous. 

For D n ^]\f(x]\f(s)), we will see that what really matters is the rate of con- 
vergence of 



on intervals where £n(s) is "under control." The decay to zero of this quan- 
tity is what really hurts us in terms of rate when we cannot use (for the 
trade-off reasons described in Section 2.2) an optimal centering and scaling 
sequence for D n ,jv, that is, when we have to work with D n ,iv(5/v(s)) instead 
of D nt N(xN(s)). We will see in Section 3.5 how we can nonetheless overcome 
this difficulty. 




K N C( x N{s)) ~ S 
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3.3.1. Bounds for N 2 ^ 3 B n ^ . Recall that 9 n ,N(z) = F n ^{z){^ n ,N / z) ■ So, 
using equation (5), we get 

B n>N (x N (s)) 

9 / Q 

= \F ntN (x N (s))(fi n>N /x N (s)) -r N Ai(K^ ()\ 

UN 



a 



n,N 



■1^- 

K N 



^r^m^N/xNis)) - i|iAi(4 /3 oi 
E - 1( 4' 3 c)M,4' s c)(^)™(^) 1/6 r^(o 



Note that the last line [without 0(^-)] was bounded by C(so)e s on 



[sq,oo) in [11], (A. 8), so we just have to concentrate on the first part of the 



sum. 



To do this, we split [so,oo) into [so,si], [si , iV 1 / 6 ^] and [N^-^si, oo). We 
treat the problems in decreasing order of difficulty. 

Case s 6 2i jy = [s±, A 1//6 si]. First a note on s±: it is chosen as in [11], 

(A. 8), that is, it is such that, for s > si, 2/3k^ £ > s. As we explain in 
Appendix A. 6. 4, we can choose the same s± for all values of 7, the limit of 
n/N. Also, we can assume that s\ > 1, which guarantees that Ai is positive 
on [si,oo), that Ai' is negative and increasing on this interval, and that 

. , , exp(-2x 3 / 2 /3) 
*(*)< % J , 

using properties of the Airy function cited in [15], pages 393-394. 

Now recall that we chose xjy(s) = /i n ,AT + Cn.iV'S; in equation (A. 3), it 
means that £n(s) = s<7 n) jv//ejv, since Kjv^2 — H-n.N- As an aside, let us point 
out that in what follows, we will often drop the dependence on s of £n(s) 
to simplify the notation. We will also use the notation 7] jy for a specified 
sequence of real numbers having a finite limit (when 7 is given) and defined 
in Appendix A. 2. With this in mind, we have, through equation (A. 3), 

<at/ x N {s) 



1 - \VNSN + 0(8%)) + 0(4)) 

5 /V Hn,N / 



= (l + rj N £ N + 0(e N )), 

as kn I ' Hn,N X 1- Once again, fj^ has a finite limit as ./V — > 00. Now on J^jv, 
£tv = O^ -1 / 2 ). Note that this estimate is also valid if we use xn(s) = 
Mn,JV + &n,NS, under certain conditions on jl nj N arid a n> N. The ones we will 
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eventually use easily satisfy those. Also, in what follows we use "coarse" O, 
that is, what appears in them may not be the optimal quantity, but it is 
definitely good enough for what we need. We conclude that 



V S G 1 1 A? 



K N 



a 



n,N 



1/6 



r 1/4 (oowzjv(s)) = i + vnsn + o(iv- 4 / 5 ). 



Hence, there exists x(7)> a function independent of N and n and, hence, 
a constant at 7 fixed such that 



AT 2 / 3 



K N 



a 



n,N 



1/6 /- 1/4 (0(*WM*))-i 



when s G ii JV) an d N is large enough. As explained in Appendix A. 6, x(l) 
can be chosen independently of 7. Hence, we will call it simply which 
will represent a generic constant, which can change from display to display. 
[Note that iV(so,7), the integer after which these estimates are valid, might 
depend on 7.] 

Now since s > s±, 2/3ktvC 3 / 2 > s, and therefore, 

on Ji >JV , |Ai(K^ 3 C)| < xexp(-s). 
Combining the two previous results, we get, for s G I\ jv, 



K N 



n,N 



\ V 6 

J r 1/4 (e)(/in,AT/^(s))-l 



l Al (^ 3 C)l < Xsexp(-s) 



< xexp(-s/2). 

Case s G I2,n = [A^ 1 / 6 si, 00). Here we can act heavy-handedly: the fast 
decay of the Airy function alone will suffice for our purposes. As a matter 
of fact, we use the very coarse bound 



K N 



a 



n,N 
< 



1/6 



r l '\0^n,N/x N {s))-l 



K N 



a 



n,N 



1/6 . 



|Ai(4 /3 oi 



r 1/4 (6(^^/x/v(s))Ai(4 /3 c) 



+ |Ai(4 /3 oi- 



Now, since Ai < ME 1 , using (A. 8) in [11], it follows that 
1/6 



f^N_\ /-V4(£)(^/ x „( a ))Ai(4 3 C) 



< xexp(-s). 



Since s > A^ 1 / 6 si, we have s > 4/31og(iV), which implies that 

N 2/3 exp(-s) <exp(-s/2). 
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Similarly (using 2/3ktvC 3 ^ 2 — s > an d the bound on Ai(x) given in [15], 
page 394), |Ai(^ 3 C)| < exp(-2K7vC 3/2 /3) < e~ s < N- 2 / 3 e~ s / 2 . We can there- 
fore conclude that 



iV 2 / 3 



rjv 



a 



n,JV 



1/6 



/" 1/4 (0(Vn,N/x N (s)) - 1 |Ai(4 /3 C)| < X (e~ s/2 + e- 

< xexp(-s/2). 



Case s £ Jo,jv = [so 5 si] . The arguments we used on apply and show- 
that 



AT 2 / 3 



K N 



a 



n,N 



1/6 

r 1/4 (OMs)/Vn,N) 



1 /s 

For s G io.JVj the sequence Kjy C(x,/v(s)) is bounded, because, at n, iV 
fixed, (" is an increasing function of s, and it is uniformly bounded in N 

since C{ x n{so)) —> $o and C{%n{si)) — ► s±. So we have, for N large 
enough, 



-i 



1 



<x(Hvi). 



VsG J, 



0,7V 



and therefore, 



VsG Iq,N 



-2(|s | V1)<4 /3 C(xat(s))<2si, 



|Ai(4 /3 C)|< sup |Ai(s)|. 

s6[-2(|a |Vl),2si] 



A point on notation: in several displays, we will use quantities like — 2|so|. 
Our arguments are correct if sq ^ 0, but we can replace everywhere |so| 
hy | so| V l and they are valid whether or not sq = 0. Aware of this, we 
nevertheless choose to write |so| instead of |so| VI to avoid cumbersome 
notation, and given the fact that this is really a minor detail. Going back to 
the analysis, we obtain 



AT 2 / 3 



rr 



n,N 



1/6 



r 1/4 (£)(*7v( s )//^)- 



l 



|Ai(4 /3 oi 



<X(MV1) sup |Ai(s)| <C(s )exp(-s/2). 

se[-2|s |,2si] 

We can choose for C in the previous display, up to a constant, the func- 
tion C(s ) = {sup sG[ _2| B0 | )2ai ] |Ai(s)|sup se[ _ 2 | S0 | )2si ][exp(s/2)(|s| VI)]}. This 
is of course a continuous and nonincreasing function of sq. C (and the afore- 
mentioned constant) can be chosen independently of 7 through arguments 
similar to the ones mentioned when the problem of the dependence of x on 
7 arose on Iin- 



18 N. EL KAROUI 

We have therefore shown that if n/N — > 7 £ [1, 00), for all sq we can find 
iV(so,7) such that for s > so and N > N(so,j), 

N 2 / 3 B nyN (x N (s)) < C{s Q )e- s ' 2 . 

C is a continuous nonincreasing function that is constant on [si,oo) and 
independent of 7. [Actually, the explicit (up to a constant) expression for C 
given above shows that we could replace C by a constant, but it is of little 
use given what follows.] 

3.3.2. Bounding N 2 / 3 \D n ^(xN(s))\ • Here we are of course going to be 

2 /s 

relying heavily on first-order asymptotics for (,(xn(s)), given in equa- 
tion (A. 4) and the fact that |Ai'(s)| is decreasing on M + (Ai' is increasing 
because Ai" is positive on M + according to the Airy equation, and we know 
that Ai' is negative on We use the same decomposition of [sq, 00) as in 
the B n ^ case. 

Case s G I\^n ■ We have from equation (A. 4) 

i^n 3 C( x n(s)) = £NKN (l + -£ N Vn + 0{e 2 N ) ) . 

o- n ,N V 5 / 

Our choice of centering and scaling gives en = £jv(s) = sCn,Af/^7V- So, when 

9/0 

N is large enough, mm(K^ ((xn{s)), s) > s/2. Therefore, using the mean- 
value theorem and the fact that |Ai'| is decreasing, 



|Ai(4 /3 C) - Ai( S )| < |Ai'(s/2)||4 /3 C -s\< X ( 7 )|Ai'( S /2)| S 



2°~n,N 



\^{s/2)\s 2 < °' Voim ' <^ s, \ 



K N 

Now since s > s\ > 1, using the first formula on page 394 of [15], we get 

^?2V4 

which shows that indeed N 2 /^\D n ^\ < x^~ s ^ 2 on I\,N- (x can De chosen 
independently of 7, as shown by arguments similar to those made in Ap- 
pendix A. 6.) 

Case s £ h,N- hi this case, we can be as rough as we were on the corre- 
sponding interval for B n ^: 

N 2 /*\D n>N \ < iV 2 / 3 (|Ai(4 /3 C)| + |Ai( S )|) < X e~ s/2 - 
Case s £ Io t N- Here, as we showed in the corresponding case for B n ^, 

C stays uniformly bounded when N goes to 00. Moreover, the arguments 
given for J^jv still hold, and we have 

iV 2 / 3 |Ai(4 /3 C) - Ai(s)| < X s 2 sup |Ai'(a)| < C{s Q )er s / 2 . 

[-2|s |,2si] 
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C can be chosen — up to a constant — to be the continuous nonincreasing 
function C(s ) = {sup se[ _ 2M]2si] | Ai'(s)| sup sg[ _ 2Nj2si] [exp(s/2)(s 2 V 1)]}. 

Note that this C tends to oo as so — > — oo, and does it like | so 1 1//4 , using 
well-known properties of Ai'(s). The constants x (which look like they are 
functions of 7) can be shown to be independent of 7, through estimates 
given in Appendix A. 6. 

We have shown the inequality (II). 

3.4. About (P2), (P3) and (PI). We conclude from the previous sub- 
section that if we chose fi n ,N = Mn-i,iV and & n ,N = °~n-i,N, we would have 
A 2 / 3 ||G T - G|| 2 < C(s )exp(-s/2). The problems for (P3) and (PI) are es- 
sentially the same, so we will just focus on (P3). We have to understand how 
a nonoptimal centering affects the rate of convergence of H T to G. What we 
will see in Section 3.4.2 is that using n n -i.N and &n-i,N 011 H T (instead 
of the "optimal" fJ, n ,N-i and o" ni 7v_i) prevents us from being able to show 
convergence of H T to G at a speed faster than A 1 / 3 . Since we will need to 
compromise between H T and G T (because what is optimal for one is not 
optimal for the other), "favoring" G T over H T (or vice- versa) turns out to 
be a bad choice for the global problem and we need to investigate a new 
problem we now set up and will solve in Section 3.5. 

Let /2 n> Ar and d Ui N be, respectively, a centering and scaling sequence. 
Let us further assume that fl n> N — Mn-i,JV = 0(1) and ((T n ,N /o~n-i,N — 1) = 
0(iV _1 ). We will show that, in this situation, (P2) holds. Later, we will show 
that we can find a pair (pLn,N,&n,N) such that (PI) holds and at this point 
Theorem 2 will be proven. 

So we are now dealing with xn(s) = fin,N + &n,NS- We have 

En-1,N = (An,Ar ~ Mn-l,Jv)/ «n-l,JV + s &n,N / K>n-1,N- 

Hence, when we work with K n -l,N £ n-\,N I >cr n-l,Ni the first term is already 
0(iV -1 / 3 ). This is fundamentally what "harms" the rate when we do not 
choose an "optimal" centering and scaling sequence. 

We now look in more detail at B n _i^(xN{s)) and D n _i^(xN(s)). We 

1 /2 

will show in Appendix A. 1.4, that a n -i^ = y / ajvc re _i N&n,N / l*n-l,N g° es 
to 1 at rate 1 [i.e., it is 1 + ©(A" 1 )] and that it therefore satisfies the 
assumptions put forward in Section 3.2.2. 

3.4.1. £? n _i : jv(xjv(s)) situation. As we will soon see, this part is not 
really problematic: we keep the 2/3 rate even when choosing a — reasonable — 
nonoptimal sequence. 

Note that / 2 ,Af is not a problem, as the upper bounding relied on the speed 

of the decay of the Airy function, and using (A. 4), we have k^-i jvC(^JVW) — 
0.5 + 3s/4, if s > si and n, N large enough. 
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Now on Iin and Iq^n, we are still fine: using the fact that £ n _i 5 Ar = 

(fin,N — Pn-1,N) / Kn-1,N + S<T n ,N / Kn-1,N and (fi n ,N ~ Mn-l.Jv) / '«n-l,iV = 0(iV _1 ), 

the analysis carried above still holds (we give more details on this in Sec- 
tion 3.5), and we have, VsG h,N, 

r n -i,N -3 / 7 (6 - / \ 

n 2 {n-l} &n,NS . /-i/ A t-4/5^ 

= 1--^ ; £ n -l,AT ! hO(iV * /0 ) 

fJ"n-l,N 

= l + V { N n ~ 1} e n . 1 , N + 0(N-^). 
Using the same ideas as above, we conclude that 

N 2 / 3 B n „ ltN (x N (s)) < C(s )exp(- S /2), 
where, once again, C can be chosen to be continuous and nonincreasing. 

3.4.2. D„_i j 7v(^A r (s)) situation. The problem is essentially the same as 
it was in Section 3.3.2: dealing now with e n _i at, we get, by simply applying 
(A.4) 

K 1%C(xn(s)) = s + 0(N~ 1 / 3 ). 

We can then apply the mean-value theorem to get exponential bounds, but 
the problem remains: the speed cannot be shown to be faster than iV 1 / 3 . 

This completes the proof of (P2), since it shows that (3) holds true. It 
also shows that (P3) holds. 

3.5. Better centering and scaling. We now turn to the problem of finding 
centering and scaling sequences such that (PI) holds and the assumptions 
we made (about the relationship between the sequences £t n> N and a n ^ on 
one hand, and fJ, n -i t N and a n -± t N on the other) in the previous subsections 
are valid. What we need to do is relatively clear: we need to compensate the 
iV -1 / 3 deviation of G T from G by the TV -1 / 3 deviation of H T from G to get 
a higher rate of convergence for G T + H T to 2G. The discussions above show 
that the only region that is problematic is Jo, at U Ii t N, and the problems 
arise only for D n _\^ and D n jj-\. 

So let us now focus on 

An,jv(£/v( s )) = \Vt{xn{s)) + i) T {xN(s)) - \/2Ai(s)|. 

We have 

V2Al N (x N (s)) 
(6) < a n -i jN \B n _i, N (x N (s))\ + a niN -i\B rLiN _i(x N (s))\ 

+ \a n -i,Nr n -i,ND n -i tN (x N (s)) + a n> N-ir n ,N~iD n> N-i{x N (s))\ 
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and our aim is to show that (with the same quantifiers as usual) 

(13) N 2 / 3 Al N (x N (s))<C{s )exp{-s/2) on [s ,oo). 

Once again, C will be shown to be a continuous and nonincreasing function. 

Neither B ti -i ! n(xn(s)) nor B Ui n-i(xn(s)) will cause problems in terms 
of rates (as shown by the arguments given in Section 3.4.1), so we just need 
to focus on the first term of (6). Before we proceed to giving the needed 
explanations, let us introduce the notation: 

e N = — '■ '— H '■ — s, 

K n -1,N «n-l,JV 

jj"n,N — Mn.JV-l , &n,N 

SN-1 = 1 S, 

K n,N-l K n,N-l 

s a fin.N — fJ"n-l,N 

O n -l,N = 



N-l 



«Vi-l,iV 
A ^n,N — Mn,JV-l 



We have xn{s) / K n -i,N = £2 + ^A r and similarly for ejv-i- From now 
on we make the assumptions that fi n ,N — Mn-l,jv = 0(1), jl n ,N — Mn,JV-l = 
0(1) and that <J n ,N / &n-i,N — ^-,S , n,N/^n,N-i ~ 1 = 0(N~ 1 ,n~ 1 ), which we 
will show (in Appendices A. 1.2 and A. 1.3) hold for our eventual choice of 
centering and scaling. Finally, we note that since r n __uv and r n ^^\ are 
1 + 0(1 /N), they will not affect the discussion that follows, and so we drop 
them for the sake of simplicity. 

The only real problem is with 

&n,N(xN(s)) = \a n -i,ND n - ltN (x N (s)) + a n>N -iD nj N _i (x N (s)) | . 

3.5.1. Analysis of A^jy. Let us first remark that the coarse approach 
explained in detail above showed that on we have A^ 2//3 D n „ijv(xAr(s)) < 
n,N-i(%N(s)) < C(so)e s l 2 and, therefore, we do not have 
to worry about this part of the real line. In what follows, we therefore assume 
that s £ 7o,JV U Ii t N- 

To show that A^ N goes to zero at the 2/3 rate claimed in (13), we just 
have to use Taylor's formula with integral remainder: we are going to choose 
fln t N such that the first-order terms [in Ai'(s)] that appear cancel out. Then, 
we will show that the remainder is 0(A r_2 / 3 e _s / 2 ). 

Using equation (A. 4), we have 

k u-\M x n{s)) = '— 1 + -e N r] N + 0{e N ) . 

&n-l,N V 5 / 
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Let us call 

Un-i,N(s) = i\!~i,nC{xn{s)) - s. 

While keeping in mind that itn-i,iV"( s ) depends on s, we will often drop the 
s to alleviate the notation. Now we use the fact that Ai"(x) = xAi(x) to get 
(through Taylor's formula with integral remainder) 

Ai ( K n-l,ArC(^(s))) = Ai(s) + U n -l >N Ai'(s) 

™n-l,jV 

(un-^N - y)(s + y)Ai(s + y) dy, 



and therefore, 

Oi n -l,ND n -i^N + &n,N-lD ni N-l 

= (a n -l,NU n -l,N + an,JV-l^n,AT-l)Ai / (s) 

+ a n _ ltN R N (s) + a n>A r_ii?iv_i(s). 

We start by focusing on (a n -i t NU n -i,N + otn,N-iu n ,N-i) and we will show 
later that Rn and Rn-i decay to zero fast enough for our needs. Since 
ot n -i,N = 1 + 0(1/N), it is clear that if we can show that Rn and Rn-i 
decay to zero at rate 2/3, Ou-i^Rn and a nt N-iRN-i will decay to zero at 
the same speed. 

1/2 

Before we proceed, we recall that a n -i t N = v / fl/vc n _i N&n,N / i^n,N- 
Remark on u n -\ : N- Using equation (A. 4), we have 

/ \ K n-l,N ~ , 2 _ r\t~2 \ 

U n -i tN {s) = —£ N 1 + -£nVn + O(ejv) 

0~n-l,N V 5 

where tjn has a finite limit. Recall that en = $n-i N + sa n n / 'ft n -l at, where 
6 n - 1 , N = 0(N- 1 ). So 

/ x «n-l,JV r , / &n,N -A 

«n-l,M s ) = d n _ijv + Is 

Cn-1,N \&n-l,N J 

2VN K n-l,N ~2 n f £%Kn-l,N 

H - e N + u 

t><T n -l,N V CTn— 1,JV 

Let us focus on fjv for a moment. It is clear that |ejv| < %iV — 2 / 3 ( | s | V 1), 
where x is independent of N, s and 7. It follows that 



2777V Kn-1, TV ~2 , rV ~3 / \ 

e 7V + V{£ N Kn-l,N/0-n-l,N) 



<x(M 3 vi). 



5oYi-l,iV 

The fact that x can be chosen independently of 7 is shown in Appendix A. 6. 
In other respects, we recall that, as mentioned in [15] for x > 0, 

|A „ N| Z 1 /4 e -2/3* 3 / a / 7 \ 
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from which we deduce, along the lines of the analyses we did before, that 



jV 2 /3 



2VNKn-l,N ~2 



e N + 0(ej V re n _i ) jv/o'n-i,Jv) 



|Ai'( S )|<C( S0 )e^ /2 , 



5a n -l t N 

C( S0 ) = {snp [sii00) [ e s / 2 - 2s3/2 /3 s i/4] + S u P[ _ 2 | S0 | :2si] e^ 2 |Ai'( S )|(| S | 3 V 1)}, for 
instance. Hence, C can be chosen to be a continuous and nonincreasing 
function; in this particular instance we could replace it by a constant. 
In other words, for our rates purposes, it is enough to focus on 



U n -1 



K n -1,N 



0~n,N 



l)a = 0((|s| Vl)^- 1 / 3 ) 



Note that the same analysis applies to u n 



N-l- 



An intuitive choice for fj, n ,N- At an intuitive level, our biggest problem 
comes from the "centering" problem, so it is natural to try to get rid of it 
by canceling its effect and then verify that we then get the rate we were 
expecting. 

The "centering" term, the one that appears because one time (i.e., for ip- 
related matters, or parameters (n — 1,N)] the "optimal" centering is jU n -i,jv 
and the other time [i.e., for ^-related matters, or parameters (n,N — 1)] it 

IS Hn,N-l, is 



K 



CN = OCn-XJSl- 



n-l,N 



(7) 



&n-l,N 



&n-l,N + Ct n ,N-l 



K n,N-l 
0~n,N-l 



0- n ,NV a N 



Canceling it leads us to choose 

1 



Da 



-1/2 
n.N-1 



+ 



1 



a 



1/2 



(centering) 



1/2 

n-l,N a n,N-l 
1 

Mn-l,iVCr n _i jv 



1/2 

Vn,N-ia nN _ x 



We will show in Appendix A. 1.2 that we indeed have jl n ^N — M«-i,Af = 0(1). 

Study of a n -i,JVUn-i,JV + ctn,N-iu n ,N~i- The conclusion of our remark 
on u n -i t N was that we can focus on u n -\,N rather than u n — i jv when dealing 
with the 2/3 rate. We have already seen that the choice made in (centering) 
led to 



(8) 



s N s 



a 



n-l,NUn-l,N(s) + an,JV-l%,JV-l(«) — CN 



a 



1/2 

ra-lJV 



a n N 



Mn-l,jV \a n -l,N 



1 + 



a 



1/2 

n,JV-l / a n M 



Mre.JV-l \a n ,N-l 



>s. 
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So a reasonable choice is to pick cr n .jv so as to cancel this term, that is, after 
defining 



ln,N 



1/2 

^n-l,jVQ- niA f_i 

1/2 : 
/%,Ar-l°n-l,iV 



(scaling) 



&n,N = (1 +7n, 



+ 



ln,N 



Cn-lJV &n,N-l 



Remark on the centering and scaling. Note also that by picking a a n jv 
such that the "scaling sequence" sat is an 0(A^ 2 / 3 ), and a An,iV that makes 
the "centering sequence" cn be 0(iV~ 2 / 3 ), we get that 

(a n -i,JVttn-i,Jv(s) + a ni Ar_iii ni AT_i(s)) = (|s| V l)0(Af~ 2/3 ) 

and we obtain the iV 2 / 3 speed for the original problem. 

Bounding the remainders appearing in the Taylor expansion. We first 
need to remark that 

OnI ltN , Un - 1>N (s)=0(s 3 N~ 1 / 3 ), 
On I , N , n n _ lj7V (s) = O(A^ 1/3 ) S0 . 

We use the notation O(-) S0 to emphasize the fact that the constant implicit 
in the O possibly depends on sq- These estimates also apply to Un,N-l- All 
the implicit constants can be chosen uniformly with respect to 7. 

On Ii t N, we have, if N is large enough, s + n n _i j 7v(s) > s/2. Hence, we 
get, using the fact that the Airy function is nonincreasing and positive on 
this interval, 



Rn(s) 



U n -1,N 



(un-i,N ~y){s + y) Ai(s + y) dy 

< (|u n _i iJV (s)| 2 /2)((s + u n - hN (s)) V s)Ai(s/2). 

Now ((s + u n -i,Ar(s)) Vs) = 0(s 3 ), and we therefore get, using only the 
control on N provided by u n -\^{s) 2 , 

N 2 / 3 R N (s)< X exp(-s/2). 

On the other hand, on Io t N, we have, for N large enough, — 2|so| < s + 
u T i-i,n(s) < 2si, and so we bound the remainder by 



Rn(s) 



U„-l N 



1JV 



y)(s + y)Ai(s + y)dy 



<(\u n -i )N (s)\ 2 /2)2( Sl V|s |) max |Ai(s 

[-2|so|,2ai] 
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Again, using the fact that it n -i,jv(s) = 0(N 1 ^ 3 ) S0 on this interval, we con- 
clude 

N 2/3 R N (s) < C(s ) exp(-s/2). 

Conclusion. The same analysis applies to Rn-±, and this finishes the 
proof of the fact that 

N 2 / 3 Al N (£ N (s)) < C(s ) exp(- S /2), 

with a choice of centering and scaling satisfying the conditions set forth in 
our remark on centering and scaling. 

4. Discussion. 

4.1. Centering and scaling. The proof confirmed the empirical findings 
in [11] that small changes to ji n .N and a nt N can drastically improve the 
quality of the Tracy- Widom approximation, and the relevance of the Tracy- 
Widom law in small samples. This is an important fact for applications. 

We recall the main conclusion of the analysis that we carried above: there 
is some liberty in choosing the centering and the scaling, as long as the 
chosen centering and scaling sequences (resp. fl nt N and d n ,N) satisfy the 
following properties [see equations (7) and (8) for the definitions of cn and 
s N ]: 

c N = 0(N~^), 
SN = 0(N~ 2 / 3 ) 

and fj, n>N - Hn-^N = 0(1), a njN /a n ^i jN = 1 + 0{N~ l ). 

We finally note that different choices of p, n< N and cr n jv (different from the 
ones indicated in Theorem 2, but satisfying the conditions just mentioned) 
might affect how the bounding functions behave with respect to 7. In other 
words, we might not be guaranteed that the bounding function M in The- 
orem 2 can be chosen to be independent of 7, but we are guaranteed that, 
for any chosen 7, the convergence is happening at rate 2/3. We give more 
detail on this issue in the next subsection and want to point out that with 
our choice of (l n ^N and a n ^ the bounding function M in Theorem 2 can be 
chosen to be independent of 7. 

4.2. The bounding function M . In Theorem 2, a bounding function M 
appears; it is important to know how it behaves. We need a few preliminaries 
to ease the discussion. 
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4.2.1. A remark on ||S||i(s). First, we need to understand the behavior 
of Hsu function of s, when S is viewed as an operator on L 2 ([s,oo)). 

Using the integral representation of S, we see that it is a positive sym- 
metric operator on L 2 ([s,oo)), and so we can evaluate, through Mercer's 
theorem, its trace class norm (see, e.g., [13], Chapter 30 and, in particular, 
Section 30.5, which can be easily extended to intervals of the type [a,oo) 
when the kernels have exponential decay properties at oo similar to that of 
the Airy operator). Using Airy integrals spelled out in [14], Al.ll(iv), we 
have for S, as an operator on [s,oo), 

/•OO /"OO 

~ / / Ai 2 (x + u) dxdu 
Jo Js 

= (-Ai(s)Ai'(s) - 2sAi'(s) 2 + 2s 2 Ai 2 (s)) /3. 

We can show that this quantity is unbounded as s — > — oo by using ele- 
mentary properties of the Airy function. As an aside, let us mention that 
||S||i(s) is of course a decreasing function of s. The order of magnitude of 
the previous quantity is |s| 3//2 when s — ► — oo. On the other hand, the speed 
of decay of the Airy function at oo shows that, on [so,oo), 

||S||i(s) < C(s )exp(-s), 

for a continuous and nonincreasing function C. (Its value may change from 
display to display, but it will always be a continuous and nonincreasing 
function.) 

Since ||G|| 2 = ||S||i/2, the previous estimate also holds for UGH 2 ,. 

4.2.2. Expliciting the bound. We deduce from Lemma 2 that 
2||S T - S||i < \\H T + G T — 2G|| 2 (4||G|| 2 + \\H T + G T - 2G|| 2 ) 

+ 1 1 H T — G T 1 1 2 • 

Making use of Lemma 3 and the bound we obtained for ||S||i, we get, for 
large enough and s on [so,°o), 

iV 2/3 ||S r - S||i < G( So )e- s/2 (5G( So )e- s/2 ) + G 2 ( So )e" s . 
Therefore, 

^ 2 / 3 ||S r -S|| 1 <G( So )exp(- s /2), 

for yet another continuous nonincreasing function G. We remark that if the 
G appearing in Lemma 3 grows polynomially as sq — > — oo, the G appearing 
in the previous display will also do so. 

Finally, combining the previous estimate with Lemma 1, we obtain, when 
N is large enough and s is in [so,oo), 

N 2 /\P(l n , N <s)- F 2 (s)) < C(s )e- s e 2+2 ^\ 
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where S is seen as an operator on L 2 ([s,oo)). Using the fact that ||5||i is a 
nonincreasing function of s, we finally get 

M(s ) = C(s )e 2+2 ^ s °\ 

4.2.3. Qualitative properties of M . The (continuous and nonincreasing) 
function C appearing in the course of the proof of Fact 2.2.1 is essentially 
controlled, especially when s — > — oo by the maximum of |Ai'| on [s,oo). 
This roughly behaves like |s| 1//4 when s goes to — oo. A careful look at the 
intermediate steps in the proof of Fact 2.2.1 shows that C can be chosen to 
grow polynomially when sq — > — oo. On the other hand, this function can be 
taken to be constant on [si, oo). As explained in Section 4.2.2, such remarks 
carry over to the choice of C made in the definition of M . 

Since ||5||i(so) is of order |so| 3//2 at — oo and appears in an exponential 
in the definition of M, we see that this part will be much more important 
in the growth of M than the function C(sq). It also appears that M grows 
when its argument goes to — oo and does it fast. 

Note that the bound for the Hilbert-Schmidt norm of the difference of 
the operators (Lemma 3) could be a little improved upon by using better 
estimates than exponential bounds for Ai and Ai' when their argument 
is negative and large. Indeed, the analysis presented to obtain Fact 2.2.1 
actually provides a polynomially growing bounding function P (of so) for 
the difference of the functions we are interested in on intervals of the type 
[— so,si]. In other words, we could replace C(so) exp(— s/2) by P(so) on 
those intervals P(so) growing polynomially to oo as sq — > — oo. The speed of 
growth of the function ||5||i(s) when s goes to — oo nevertheless makes such 
improvement of little importance. 

So our bounding function M(so) exp(— s) is a trade-off between two quan- 
tities. M grows — theoretically — very fast when so tends to — oo. On the 
other hand, it seems to not be too large for s nonnegative, or s negative 
but not too large in absolute value. Finally, the form of the bounding func- 
tion might shed some light on the empirical findings that the Tracy- Widom 
approximation is particularly good in the upper tail of the distribution. 

4.2.4. Role of 7 in the problem. The analysis we presented to obtain 
Fact 2.2.1 and through the previous two subsections shows that, given 7 
(the limit of n/N as they go to 00), we can find M 7 such that Theorem 
2 holds (with M 7 taking the place of M). We present in Appendix A. 6 a 
study of the dependence (with respect to 7) of the intermediate functions 
we obtained. It shows that these functions M 7 actually satisfy 

M 7 (s) < M(s)(l + 7~ 1/2 ) < 2M(s) < M(s), 

since 7 > 1. 
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Table 1 

Empirical quality of new centering and scaling sequence: simulations. The leftmost 
columns display certain quantiles of the Tracy-Widom distribution. The second column 
gives the corresponding value of its c. d.f. Other columns give the value of the empirical 
distribution functions obtained from simulations at these quantiles. The centering and 
scaling sequences are fi 



Quantiles 


TW 


1000 x 10 


4000 x 10 


10000 x 10 


5000 x 30 


4000 x 100 


2*SE 


-3.73 


0.01 


0.010 


0.014 


0.013 


0.010 


0.009 


0.002 


-3.20 


0.05 


0.048 


0.060 


0.058 


0.052 


0.046 


0.004 


-2.90 


0.10 


0.100 


0.113 


0.113 


0.103 


0.095 


0.006 


-2.27 


0.30 


0.300 


0.313 


0.312 


0.304 


0.295 


0.009 


-1.81 


0.50 


0.510 


0.502 


0.513 


0.501 


0.497 


0.010 


-1.33 


0.70 


0.707 


0.701 


0.704 


0.700 


0.696 


0.009 


-0.60 


0.90 


0.899 


0.904 


0.901 


0.900 


0.893 


0.006 


-0.23 


0.95 


0.949 


0.953 


0.952 


0.952 


0.947 


0.004 


0.48 


0.99 


0.990 


0.991 


0.990 


0.990 


0.989 


0.002 



In some sense this hides the role of 7, which might appear from the 
previous display to have little influence on the problem. This parameter 
nevertheless seems to have an important role in determining N(sq,^), that 
is, the integer after which our estimates are correct. The fact that, numer- 
ically, large values of 7 seem to yield an approximation of lower quality in 
simulations is likely to be a manifestation of the role of 7 in iV(so,7)- 

4.3. Simulations. Part of the impetus for this study was practical. We 
were wondering if it was possible to find new centering and scaling se- 
quences that would improve the quality of the Tracy-Widom approximation 
in "small" dimensions, a crucial need in Statistics and, more generally, for 
the relevance in applied fields of such approximations. We made some sim- 
ulations to see how the sequences we found and chose behaved in practice. 

Tables 1-3 were constructed (as in [11]) in the following way: we built n x 
TV matrices X, filled with i.i.d. standard complex Gaussian entries. We made 
10,000 simulations for each matrix size, extracting the largest eigenvalue of 
X*X, recentering and rescaling it through p, n ,N and & TIi n- The quantiles 
of the Tracy-Widom law W2 are courtesy of Professor Iain M. Johnstone. 
The columns corresponding to matrix sizes give the value of the empirical 
distribution function we found for at the Tracy-Widom quantiles. If the 
approximation were "perfect," all the columns would be equal to the second 
column starting from the left. 

The conclusion we can draw from these simulations is that the approx- 
imation is remarkably good in the upper tail of the distribution (as also 
remarked in [11]; this is an excellent property to have for one-sided tests in 
Statistics) and that the new sequences seem to improve the quality of the 
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Table 2 

Empirical quality of new centering and scaling sequence: simulations. The data was 
generated as in Table 1. We experiment with (increasingly) large 7 



Quantiles 


TW 


5000 x 50 


20000 x 


50 50000 x 


50 200 x , 


5 2000 x 5 


20000 x 5 


2*SE 


-3.73 


0.01 


0.011 


0.011 


0.012 


0.008 


0.014 


0.014 


0.002 


-3.20 


0.05 


0.052 


0.050 


0.056 


0.042 


0.058 


0.066 


0.004 


-2.90 


0.10 


0.101 


0.105 


0.108 


0.089 


0.109 


0.121 


0.006 


-2.27 


0.30 


0.301 


0.308 


0.314 


0.293 


0.312 


0.327 


0.009 


-1.81 


0.50 


0.500 


0.498 


0.509 


0.502 


0.509 


0.523 


0.010 


-1.33 


0.70 


0.706 


0.700 


0.703 


0.703 


0.714 


0.711 


0.009 


-0.60 


0.90 


0.903 


0.904 


0.903 


0.904 


0.904 


0.904 


0.006 


-0.23 


0.95 


0.951 


0.951 


0.953 


0.956 


0.953 


0.952 


0.004 


0.48 


0.99 


0.990 


0.991 


0.990 


0.991 


0.991 


0.991 


0.002 










Table 3 










Empirical quality of new 


centering and scaling sequence: simulations. The data 


was 


generated 


as in Table 1. We look at 


relatively small matrices and 7 quite 


small (1 and 4) 


Quantiles 


TW 


5x5 


10 x 10 


100 x 100 


20 x 5 


40 x 10 400 x 100 


2*SE 


-3.73 


0.01 





0.001 


0.006 


0.001 


0.003 


0.007 


0.002 


-3.20 


0.05 


0.002 


0.012 


0.041 


0.017 


0.026 


0.041 


0.004 


-2.90 


0.10 


0.012 


0.042 


0.084 


0.049 


0.065 


0.089 


0.006 


-2.27 


0.30 


0.168 


0.228 


0.284 


0.232 


0.260 


0.285 


0.009 


-1.81 


0.50 


0.412 


0.454 


0.491 


0.452 


0.472 


0.490 


0.010 


-1.33 


0.70 


0.669 


0.682 


0.695 


0.684 


0.691 


0.688 


0.009 


-0.60 


0.90 


0.903 


0.903 


0.902 


0.906 


0.903 


0.894 


0.006 


-0.23 


0.95 


0.952 


0.952 


0.951 


0.953 


0.952 


0.948 


0.004 


0.48 


0.99 


0.992 


0.989 


0.990 


0.992 


0.991 


0.989 


0.002 



approximation over the so-far standard choices, especially in the cases where 
n/N is quite large. Those are among the most interesting from a statisti- 
cal point of view, as they are often encountered in "neo-classical" settings 
(n quite large and N moderately large) and they are situations where the 
rapidly developing theory of large random covariance matrices provide an 
alternative to the classical theory (see, e.g., [1], Chapters 11 and 13 and, in 
particular, Theorem 13.5.2) of multivariate statistics. 

APPENDIX 
A.l. Properties of r N , p, n ,N, &n,N and a n , N - 
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A. 1.1. Study o/rjv. We are interested in asymptotics for r^. Recall from 
[11] that 

2 _ 27TK 2 x N e 2KNCa 



Nl(N + a) 



Now K 2 x N e 2KNC0 = e- 2KN nl + N+ + = e - n +~ N +nl + N+ + . Stirling's formula gives 



n 



, =e - v+ v^f 1+ _i. + _i_ + ori 



Rewriting r%, we get 



r 2 



Let us therefore focus on g(n) 
we get 



-. Applying Stirling's formula, 



^n)=e-^(l + -\ (l-x n + x 2 n + o(x 2 n )), 
with x n = l/(12n) + l/(288n 2 ). So if we go to second order, we have 



1 + 



1 \ n+0.5 



2n 



+ 



7 



1 



8n 128n 2 



+ 



1 



1 



1 



l + x n 

From this we conclude that 



+ 



1 



Yin 288n 2 



+ ^ 



9W 



1 + 



1 



71 



and finally, 
(A.l) r N = l + 



1 /l 1 



+ T7 + 



24n 1152n 2 
1 285 



+ 0(- 



48 Vn N 2304niV 4608 



1 1 

n 2 + iV 2 



+ o 



1 1 



A. 1.2. Properties of the centering sequence. The aim of the discussion 
that follows is to show that, as we announced in the text, we have 

An.AT - Hn-1,N = 0(1) and fin,N- fhi,N-l = 0(l). 



1/2 

PROOF. Let us first remark that one can write, with a = cr n _ 1 N and 



b = a 



1/2 

n,JV-l' 



1 1W 1_ _L 
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so that 

a/v_i jv(/%jv-i — Afn-i,iv) 

fJ-n,N — Vn-1,N = 7 \ 

OfJ, n ,N-l + a/J, n -l,N 

and 

bHn,N-l(^n,N-l ~ Mra-l.iv) 



Mn,JV — Hn,N-l 



bfJ>n,N-l + a/^n-l,JV 



Hence, using the fact that apL n -i,N / '(fy-*n,iV-i) — ► 1> we just need to show 
that 

Mn— 1,2V - MrijJV-l = 0(1) 

to have the announced property. This is an easy task, if a little tedious. To 
carry it out, we study 

-u n+aiA r +/ 3 = (V^ + a + VN + 13 f. 

Expanding the square and using the Taylor expansion of (1 + a;) 1 / 2 around 
0, we have 

u n+a n+ r = n + N + a + (3 + 2VnN + Vn~N (- + 

\n TV 

+ "IT" v^iV " 2 U 2 + iV 2 J J + v^72~ ' 7V572" J ; 
if we already account for the fact that n x N. Since 

Mn— 1,7V = 'U n _o.5,/V+0.5 an d /%,/V-l = ^n+0.5,/V-0.5> 

we finally obtain 



/•in- 1,2V - Mn,JV-l = V T7 - V — + O 



iV\ „ / iV 1 / 2 n 1 / 2 



V 



iV V n ) V n 5 / 2 ' N 5 / 2 y 
Since n x iV, — y^) is bounded and we have shown that 

Mn-l,JV-/VV-l = 0(l). □ 

A. 1.3. Properties of the scaling sequence. We are going to show that 

1 = 0(KI) and 



o- n -l,N \n N 

(Behavior a n ^) 



a- n ,N-i \n N 
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Proof. We remark that one can formally write that, with obvious sub- 
stitutions, 



^ = c=(i+ 7 )(i + 2) 



Note that in a, b and 7 are nonnegative in the equation defining a n ^N in 
Theorem 2. Simple algebra leads to c — a = 70(6 — a)/(b + 07) and c — b = 
b(a — b)/{b + 07), from which we see that 

aAb<c<aVb. 

Hence, to show the properties stated in the equation (Behavior cr^ jv), it is 
enough to show that 

^-1 + ofii 



Vn,N-l \n N, 

This is of course not a surprise, since when we do a Taylor expansion of all 
the constituting parts of a n -i,N or a n> N-l, the first-order terms are of order 
1/n or 1/N. Formally, if we call 



_ Vn + a + y/N + (3 

r n+atN+p - ^ + ^_ 1/2 + ( ^ + ^_ 1/2)1/3 > 

we have (as remarked in [11], display preceding equation (A. 8)) 



3 \+a,JV+/? 



^(n + a)(iV + /3) 



We are interested in ratios of the type T n+a ^ + p/r n+ol i ^ N+ pi ^ and so we can 
focus on 



■ n + a ,N+f3 \ 3 _ f l u n+atN+f3 - u n+a ^ N+p > \ 2 y/N + (3'Vn + a' 



1~n+a>,N+f3' J V U n+a i tN+/3 f J N + (5yjn + a 

The only case that matters to us is a = — 1/2 = — j3 = —a' = (3' . Using the 
work we did on pL n -i,N — ^n,N-i and simple Taylor expansions of [(1 + 
x)/(l — a;)] 1 / 2 , we have easily (after we incorporate n x N) 



^-,^ 3 _/- 1+ ^-^ +0 /nVY 1+ vi + lv ^ 



cr ni N—lJ V n + N \n 2 ) ) \ 2\n Nj \n 2 

We deduce that indeed 

and the property is shown. □ 
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A. 1.4. Control of first term appearing in ip T . The aim of this last part 
is to show that the a n -i t N introduced in Section 3.4 has the property that 

y/aNO-l/^ 1N a ntN ( 1 \ 

a n -i,N = = — ' = 1 + O — . 

Proof. The proof will also show that a n ^-i has the same property. It 
is very simple: 



«n-l,7V 



Hn,N 

/n-3 \ 1/2 ~ 

a/4( a n-l,N \ ' Q~n,N H-n-l,N 



(nN) 1 

((n - l)iV) 1/4 ( %^ VZ ^AL (1 _ 1/n) -l/4 

V^n-lAT/ °~n-l,N Mn,iV 



r 3 , ,AV2 * 



(1-1/n)" 1 / 4 . 



0"n-l,JV Atn.JV 

Since this is the product of terms which are all 1 + 0(l/iV), we have 

a„_i jv = 1 + 0( — ). 



A. 2. A closer look at £ and /. Recall that we are interested in asymp- 
totics for xn(s) = fin,N + & nt NS, where fA ni N and d Ui N are the centering 
and scaling sequences we chose. So in general, we can write x^{s)/kn = 
£2 + £at(s), and for good choices of fl nj N and <7 n ,7Vi £n{s) 1S going to be 
small provided s is not too big. The rationale for setting up the analysis 
like this is that, to obtain a finite upper bound for the trace class norm of 
our operators, we use a "variable split" of [so>oo), involving an interval of 
the form [si,n p ' q sx], so we need a uniform control of £ over this type of 
intervals. [In our applications, iN(n p / q si) is small with respect to 1.] 

Note that we present asymptotics with en(s) > 0. If £/v( s ) < 0, the same 
results hold when we use 2/3(— C) 3 ^ 2 = Jif 2 { — fit)} 1 ^ 2 ^. So we can safely 
apply the estimates that follow without worrying about the sign of e~n{s). 
Finally, let us mention that we will often write en instead of en(s) for the 
sake of readability. 



A. 2.1. Asymptotics for £. For the sake of simplicity, we will use the 
notation xn(s) (resp. en) instead of xn(s) (resp. e~n) in what follows, even 
though we will be working with fl ni N and a n< N- These centering and scaling 
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sequences are generic sequences in this subsection. They just guarantee that 
£tv( s ) = x n(s)/kn — £(2) is "small." 
We have 

-C^ 2 (x N (s))= / g N (t)dt 



3 



£2 

(t-£ 2 ) 1/2 (t-ei) 1/2 



where fl'Ar(i) 



2/- 



Changing variables to y = (t — £2)/^ , and denoting «tv = £2 — £1 = 2y 4 — wjy, 
% = 6 = 2 + ^4-lo 2 n , we get 



3 2/?at Jo l + ye N /(3 N 

Now since a at and @n have finite nonzero limits as TV — ► 00, ye^ /o-n and 
ye^/PN stay small with respect to 1. Hence, we can do a Taylor expansion 
within the integral and integrate it. We get 

§^w>=^(§4»(^-£) + °<*>> 

In other words, if we write 

Vn = l/(2a N ) - l/(3 N , 

we have 

3/2 1/2 

(A.2) C 3/ W*)) = £ -^~ (l + jW + O(^)) . 

A. 2. 2. Asymptotics for f. First we have = so 

Q.]sj£N l + £ N /aN 
4/3^ (1 + e N /(3 N y 

from which we deduce that 

r 3/2 (6 + ^) = (^)~ 3/ V - **v» + o(^)]- 

We recall that the quantity we are primarily interested in is (kat/ct^ n) 1 ^ f -1 ^ , 
with / = //C- Combining the two aforementioned Taylor expansions, and the 
fact that /-V4 = (C 3/2 / _3/2 ) 1/6 , we get 



kjv\ V Vi/4 { Wn"n \ 1/6 {, 2 
""5~ / = — 1 - k £ JV»7jv + O 



(4) 
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We know from [11], displays (A.7) through (A.8), that 4 f N f" = a^ N . There- 
fore, we conclude that 



(A.3) r ^=^i-- £Nm + ^ N) j. 

2/3 

A. 2. 3. Asymptotics for £(xn(s)). Using equation (A. 2), we get 

i41 3 ({xn(s)) = e N n N (-Tfir^r) ' ( 1 + ^£NVN + 0(e 2 N ) 

(A.4) 



0~ n ,N 



1 + 7 £ iV?iv + O(ejy) 



A.3. A note on the continuity and the variation of v(uj,C). We first 
remind the reader that in order to control the error that the perturbation of 
the differential equation induces on its solution, Olver ([15], Theorem 11.3.1, 
page 399) introduced an error-control function: the variation of the function 
H, where H(x) = Jq y~ 1 ^ 2 v(uj,y), where v is the perturbation function of 
the differential equation. In our case it depends on a parameter, uj = 2X/k, 
which is not the case in [15]. The issue at stake is as follows: is the error- 
control function uniformly bounded on the interval where uj varies? Olver 
shows that it is finite point by point, but we need to be sure that it is 
bounded on the interval [0,2 — 5], 5 > 0, to make our rate estimates work: 
this is what allows us to neglect the term Ei that appeared in the analysis, 
because if the error control function is indeed bounded, then e-i is 0(l/iV). 
To explain the strategy, we need to jump a little bit ahead of the proof. 
Our aim is to find an interval I w (for £), possibly depending on uj, where we 
will be guaranteed to be while doing our asymptotic developments, and for 
which we have 



\u\ 1 I 2 \v{uj, u)\ du < x. 



where x does not depend on uj. 
Before proceeding, recall that 

f(",0 = ^ = 

1 

We warn the reader that we drop the dependence of / on uj temporarily. 
Also, 

, » 1 d 2 {f l ' A ) g 1 d 2 (/~ V4 ) 9 
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As mentioned above, the aim of this subsection is to explain why the afore- 
mentioned integral of |y|" 1//2 |f| remains bounded when lo 2 £ [0,4 — 5]. Note 
also that our lo is in [0, 2 — p\. 

From [15], Lemma 11.3.1, we know that the integral is finite for any given 
lo < 2. Our point here is to give a simple argument that justifies why this is 
also true uniformly on this interval. As a first step toward the resolution of 
the problem, we want to show that, 

roo 

(Rl) for lo 2 E [0,4-5], / \y\~ 1/2 \v{uo, y)\ dy < oo. 



o 



Recall also that, for x > £2 = 2 + a/4 — oo 2 , 

§C 3/2 =f /M) 1/2 *, 

so it is clear that £(tj,£) is continuous in these two variables. Moreover, if 
£ > 4, £(cj,£) is an increasing function of lo > 0. 

Now using, for instance, (El) and expanding the formula in terms of g, £ 
and / and its derivatives, one can see that, at +00, 

and that we can actually find a £1, independent of w S [0, 2 — p] but possibly 
dependent on p, such that, for £ > £1, 

1^,01 <C 2 - 

So if we can show that £) is continuous on [0, 2 — p] x [0, 00), the result 
(Rl) follows. 

Now recall that Olver in [15], Lemma 11.3.1, shows that, at co fixed, 

f f(^Q 2 J3 \~ 2 / 3 

/(£) = — * — = (?(<»,£)) jT^'Oj 

b(«. = (e - a) 1/2 /(2e), and g ( w , a = (e - &r 3/2 4 (* - £ 2 ) 1/2 ^, *) 

is a well-behaved function of £, and furthermore, that £) is continuous 
on intervals of interest. The dependence of p and q on cj comes from the fact 
that both £1 and £2 are functions of lo. Note that in the proof of Lemma 
11.3.1 in [15], Olver shows that as £ — ► £21 <?(w, £) — ► (2/3)p(cj, £2(0;)). Since u; 
is bounded away from 2, £2) is bounded away from on [0, 2 — 6] x [2, 4] . 
As a matter of fact, £2 (to) — £i(w) is bounded away from (uniformly in cj, 
when u £ [0, 2 — 5]). Furthermore, on intervals of the type [so, 00), which are 
the ones we are focusing on for our original rate of convergence problem, 
the corresponding £'s remain bounded away from 2, for large enough N, 
because £2^) is uniformly (in lo) bounded away from 2. This is what makes 
/(w,£) and l//(a;,£) well-behaved functions on [0,2 — p] x [2, 00). As we just 
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mentioned, the interval [2, oo) for £ is more than enough for our purposes, 
the leftmost our £'s can go is close to inf^ £2^) an d this quantity is bounded 
away from 2. 

With this information about /, we can show from equation (El) and along 
the lines of the arguments given in Lemma 11.3.1 in [15] that v(uj,£) is a 
continuous function in its two variables. Since £(u>, £) is invertible, continuous 
and has a well-behaved inverse, we conclude that v(u,£) is continuous on 
[0,2 — p] x [0, Hence, we deduce that 

/•OO 

3 X s.t. Vu£ [0,2- p] / \y\- 1/2 \v(u>,y)\dy< X . 

Jo 

The problem is not finished because, for s negative, £ will cross 0. But we 
have already seen that the £'s appearing in our analyses will never corre- 
spond to £'s that are less than 2. With this in mind, calling £2(0;) the £ that 
corresponds to £ = 2, we pick I u = [(^(w), 00). Let us now show that this is 
a satisfactory choice. 

The continuity arguments we just mentioned can also be used to show 
that if £2(0^) corresponds to £ = 2, we have 

3 X s.t. VcuG [0,2-p] / \y\~ l / 2 \v{uj,y)\dy<x, 
from which we finally deduce what we needed: 

3 X s.t. VcjG[0,2- P ] F(u)= \y\~ 1/2 \v(u, y)\ dy < X - 

A.4. Proof of Lemma 2. Recall that S" r = H T G T + G T H T} S = 2G 2 . The 
key inequality in what follows is ([9], Lemma IV. 7. 2, page 67) 

< ||A|| 2 ||£|| 2 . 

Our problem is to control \\S T — S\\±. Note that 

_ (Hr + G T ) 2 - (H T - G T ) 2 

2 

Let A = H T + G T and B = 2G. We have, of course, 

A 2 2 _ (A + B)(A-B) + (A-B)(A + B) 

2 

and hence, 

\\A 2 - B 2 \\i < ^{A- B\\ 2 \\A + B\\ 2 . 
We can therefore conclude that, since S T - S = (A 2 - B 2 - (H T - G T ) 2 )/2, 
2 1 1 S T — S\\i < || H T + G T — 2G 1 1 2 1 1 H T + G T -I- 2G 1 1 2 + 1 1 H T — G T \ \ \ , 
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which is the claim made in Lemma 2. 

Since we can show that H T — > G at rate at least 1 /3 and G T — > G at rate 
at least 1/3 too, it is enough to choose the centering and scaling so that 

N 2/3 \\H T + G T - 2G\\ 2 < C(s ) exp(-s/2), 
in order to get the 2/3 rate. 



A. 5. Simplified expressions for <p and ?/> and a remark on ct^r = 0. We 
present in this subsection the derivation of the simplified expressions for 
(p and ip. These are simple algebraic manipulations, but they greatly simplify 
the rate work. We also explain why the case ocn = does not create a specific 
problem. 



A. 5.1. Case of if. Recall that, by definition, we have 

<p(x) = (-if^lVNT^^ix) - ^ N -x(x)}, 

with £fc(x) = (fk(x;a)/x and (pk(x;a) = yjk\/(k + a)lx a / 2 e~ x / 2 L^(x). L% is 
of course a Laguerre polynomial, as defined, for instance, in [19], Chapter 5. 
Rewriting the previous expression, we get that, if we call v n = (— 1) N y/o-N /I, 

v N x a l 2 - x e- x l 2 \ ^/N+~^\ 



Nl 



(N + a 



-L%(x) - VN 



(N-iy. 



(JV-l + a)! 



L 



N-V 



v N x a l 2 - l e- x ' 2 < 



Nl 



■[Ln-L%- 1 ]\=< P {x). 



\ \ (N + a -1)1 

Using formula (5.1.13) in [19], we have Lf^ — = L^f . Hence, 

x a/2-l e -x/2 [L a _ £ a _^ = x ( a -l)/2 e -x/2 L (*-l) x ~l/2 _ 

We can therefore conclude that 

^x) = (-l) N ] ^^ N (x ] a-l)x^ 2 . 

A. 5. 2. Case ofip. In this case, we have, by definition, 



V 2 

and after expanding £tv and £jv_i just as before, we get 



#c) = (-l) 



N fa~Wx a l 2 e~ x l 2 UN -1)1 



(N + a)l 



{NL%-(N + a)L a N ^}. 
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According to [19], formula (5.1.14), {NL%-(N + a)L%_ 1 }/x = -L ( fi±l ) , 
from which we deduce that 

J.( T \ _ / ^(N-l) [ON (a+l)/2 -x/2 I (N - 1)\ +1 1/2 

= (-l) (7V - 1) ^^i(x;a + l)x^/ 2 . 

A. 5. 3. Case a at = 0. This situation might seem a little problematic since 
<3!7v — 1 appears in the definition of ip. Note, nevertheless, that going back to 
the definition of <p and ip (before manipulations), we have ip = ip when = 
(or n = N), and both are well defined in terms of Laguerre polynomials 
Ljy and LP N _ V So by using the expression obtained for ip for both ip and 
ip in that case, we realize that the case oin = does not create any further 
complications. 

A. 6. Dependence of error bounds and s\ on 7. First a notational point: 
we now make explicit the dependence on 7 of the intermediate bounding 
functions we obtained by subscripting them with 7. We will give a sketch 
of the proof, outlining the reasons for which we can bound M 7 by M(l + 
7" 1 / 2 ), as claimed in Section 4.2. The work is divided into three parts, and 
everything refers to Section 3.5. We first control n (xn(s)), then turn 
our attention to quantities of the type S n _i i Ar(xiv(s)) and in the last step 
explain why we control \\H T — G T ||2- All the work is done on io.iv Ui^jv, since 
h,N does not pose any difficult problems. Finally, we focus on large values 
of 7 since the case 7 uniformly bounded leads directly to uniform bounds on 
M 7 (nothing prevents us from choosing M 7 to be continuous with respect 
to 7). 

A.6.1. About A% N (x N (s)). With our choice of j2 n ,N and a n ,N, we have 
cn = sn = so a n _i i jv'Sn-i,iv( s ) + a n,N-lUn,N-l( s ) = 0. So to control 

(a n -i,NU n -i,N(s) + a n> N-iUn,N-i(s))Ai' (s) 

in terms of 7, we essentially just have to control 

e N {s) + 0{e N {s)K n - 1:N /a n - 1:N ) 



Elementary manipulations show that fc n -l,jv ~ (1+7)-^, 0"n-i,iV x 7 1 / 2 iV 1 / 3 , 
VN < V 1 + 7) and Vn-i,N ~ Mn,JV-i - 7 1/2 - 7~ 1/2 - By aggregating all this 
information, we conclude that 



2??JV^n-l,JV ~2 / > 

e vv s , 



5cr re _i jv 



<x^ 2/3 (kl 2 vi), 
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where x is independent of 7. 

One easily deduces from this that 



jy2/3 



(A.5) 



e N (s) + 0(e JV (s)«; n _i i Ar/cr n _i ) jv) 



< C(s )e- S / 2 



Ai'(s) 



On the other hand, the end of Section 3.5 makes very clear that all we 
have to do in order to bound Rn(s) and Rn-i(s) is understand |u n _i i 7v(s)| 2 . 
Since we just shown how to control u n -i,iv(s) — u n -i t N(s), we just need to 
bound u n _i > jv(s)- By definition, we have 



\u n -i,N(s)\ 



£jv(s) - S 

0~n-l,N 



Using the aforementioned asymptotic properties of a n -.\^ and fin-i^ 
[i n ^N-i, we easily obtain our final estimate: 

(A.6) \u n -l,N( s )\ < xN~ 1/3 (\s\ V l) 1 ^ < XA^ 1/3 (M V 1). 

7 

Combining equations (A.6) and (A.5), we finally get 
N 2 / 3 Al N (x N (s))< X C(s Q )e~ s / 2 , 
a bound that is independent of 7. 



A. 6. 2. About B n -i^(x]y(s)) . The key to having a good handle of this 
quantity is of course to be found in Section 3.4.1. We see that we just need 
to control ^a^aK 5 ) s &n,N / 'An,JV '■ Using the same estimates as before, we 
easily get 



r] N e N (s) + s 



0~ n .N 



fJ-n,N 



<iY- 2 / 3 x(|s|Vl)7~ 1/2 , 



which implies that 

N 2 ' 3 B n ^ N (x N (s)) < C(s )e^ 2 r 1/2 - 
We can therefore conclude that 

N 2 / 3 \<p T (s) + Ms) - >/2Ai(a)| < C(s )e- S / 2 (1 + 7 " 1/2 ) 
and we have now made explicit how C 7 depends on 7 in equation (2). 
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A. 6. 3. About (p T — ip T . Working on \<p T (s) — Ai(s)/\/2| is now quite simple 
since we just have to re-use the estimates we just obtained. We conclude that 
the dependence of C 7 on 7 is the same in equation (3) as it was in equation 
(2). We also get the same result for \ip T (s) - Ai(s)/ V / 2|. 

The combination of these three results imply that the bounding function 
in Theorem 2 has the property 

M 7 (s) < (l + 7 - 1/2 )M(s). 

Since 7 > 1, our bounding function can be chosen to be independent of 7. 



A. 6. 4. About si( 7 ). We mentioned in the course of the proof of Fact 
2.2.1 that we could choose s± independently of 7. Recall that si is chosen as 
in [11], A. 8, page 325. It is defined there as, at fixed 7, 51(7) = c( 7 )(l + 5), 
with 5 > 0. Recall that 

C ( 7 )=lim ^ - 161/2 



»-~<»(6-£i) 7 73/247-1/2 ' 

Since 0(7) is a continuous function of 7 on [l,oo) having a limit at 00, it 
is bounded. Hence, the same s% can be chosen for all 7's. 
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